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PRODU CTION OF P OLYK ETIPES IX BACTERIA AMI) YEAST 

This application claims priority under 35 USC 1 19 from provisional 
: application 60/033,193 filed 18 December 1996. The contents of this provisional 
application are incorporated herein by reference. 

Technic al Field 

The invention -.Mates to production of |_ . jlyketides in microbial hosts such as 
1 o yeast and E. co/i and to preparation of libraries containing a variety of functional 
polyketide synthases (PKSs) and the resulting variety of poiyketides. More 
specifically, it concerns supplying portions of the polyketide synthase systems on 
separate vectors for simplicity in mixing --^d matching these portions to create a 
variety of PKS resultants. This permits production of libraries of polyketide 

1 r :'j syntheses and poiyketides through a combinatorial approach rather than manipulation 

focused on a single production system. 

Bac kg round Art 

Poiyketides represent a singularly useful group of natural products which are 

2 0 related by their general pathway of biosynthesis. Representative members include the 

macrolide antibiotics, for example, erythromycin, spiramycin and tylosin, 
immunosuppressants such as rapamycin and FK506, antiparasitics such as the 
avermectins, antifungal agents such as amphotericin B and nystatin, anticancer agents 
such as daunorubicin and doxorubicin and anticnoiesterolemics such as mevinolin. 

2 S Poiyketides generally are secondary metabolites of the actinomycetes including the 

genera Sueptomyces^ Actinomyces, Actinomadurcu Micromonospora, 
Saccharopolyspora^ and Nocardia. It was estimated thai in 1986 about 6,000 
antibiotics of microbial origin had been characterized of which 70 were in clinical 
use; an additional 1 100 metabolites were reported between 1988 and 1992, 

3 0 approximately 40% of which were poiyketides. 

Despite the multiplicity of polyketide structures available from nature, there 
remains a need to expand the repertoire of available poiyketides and to synthesize a 
multiplicity of poiyketides in the form of libraries so that there is a convenient 
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substrate for screening to identify polyketides that are relevant to a specific target of 
interest. The present invention provides solutions to these needs. 

Polyketides generally are synthesized by condensation of two-carbon units in a 
manner analogous to fatty acid synthesis. In general, the synthesis involves a starter 
unit and extender units; these tfc two-carbon" units are derived from acyithioesters. 
typically acetyl, propionyl, malonyl or methylmalonyl coenzyme- A thioesters. There 
are two major classes of polyketide synthases (PKSs) which differ in the "manner" in 
which the catalytic sites are used — the so-called "aromatic" PKS and the modular 
PKS. The present invention employs codm .equences from both these classes as 

l;' 1 will further be explained in the herein application. 

Recombinant production of heterologous functional PKS — i.e., a PKS which 
is capable of producing a polyketide - has been achieved in Streptomyces and hybrid 
forms 01 aromatic PKSs have been produced in these hosts as well. See, for example, 
Khosla, C. et al J Bacterial ( 1 993) 175 :21 94-2204, Hopwood. D A el al Nature 

15 (1985)114:642-644; Sherman, D.H. el al J Bacterial ( 1 992) 174:6 1 84-6 1 90. In 
addition, recombinant production of modular PKS enzymes has been achieved in 
Streptomyces as described in PCT application WO 95/08548. In all of these cases, the 
PKS enzymes have been expressed from a single vector. A single vector which 
carried genes encoding PKS catalytic sites was transformed into E. coli by Roberts, 

2 0 G.A., et al, Eur J Biochem (1993) 214:305-3 1 1, but the PKS was not functional, 
presumably due to lack of pantothenoylation of the acyl carrier proteins. 

The present invention provides double or multivector systems for production 
of PKS and the resultant polyketides in a variety of hosts. The use of multiple vectors 
provides a means more efficiently to enhance the number of combinatorial forms of 

2 5 PKS and polyketides that can be prepared. Addition of the machinery for 

pantothenoylation of the acyl carrier proteins (i.e., a holo ACP synthase) permits 
production of polyketides in a wide spectrum of hosts. 

Disclosure of the Invention 

3 0 The invention relates to recombinant materials for the production of 

polyketides in a wide variety of hosts and of libraries of PKS enzymes and the 
resultant polyketides based on a multiple vector system. The use of a multivector 
system facilitates the construction of combinatorial libraries and permits more 
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llexibiiitv in designing various members thereof The invention also relates to such 
libraries which are essentially self-screening due to an autocrine svstcm involving 
polyketide-responsive receptors. 

Thus, in one aspect, the invention relates to a recombinant host cell and 
libraries thereof when the host cell is modified to contain at least two vectors, a first 
vector containing a first selection marker and a first expression system and the second 
vector containing a second selection marker and a second expression system and 
optionally additional vectors containing additional selectable markers and expression 
systems, wherein the expression system.^ l cd on the vectors encode and are 

2 o capable of producing at least a minimal PKS system. If the minimal PKS system is an 
aromatic system, the minimal system will comprise a ketosynthase/acyl transferase 
(KS/AT) catalytic region, a chain length factor (CLF) catalytic region and an acyl 
carrier protein (ACP) activity. If the minimal PKS system is a modular system, the 
system will contain at least a KS catalytic region, an AT catalytic region, and an ACP 

I r : activity. For modular systems, these activities are sufficient provided intermediates in 
the synthesis are provided as substrates; if de novo synthesis is to be required, a 
loading acyl transferase should be included, which will include another AT and ACP 
region. 

In one specific embodiment of this aspect of the invention, the recombinant 
2 0 host cell will be modified to contain; (a) a first vector comprising a first selectable 
marker and an expression system comprising a nucleotide sequence encoding a 
ketosynthase/acyl transferase (KS/AT) catalytic region of an aromatic PKS opcrably 
linked to a promoter operable in said cell; (b) a second vector comprising a second 
selectable marker and an expression system comprising a nucleotide sequence 

2 0 encoding a chain length factor (CLF) catalytic domain operably linked to a promoter 

operable in said ceil; and (c) a third vector containing a third selectable marker and an 
expression system which comprises a nucleotide sequence encoding an acyl carrier 
protein (ACP) activity operably linked to a promoter operable in said cell, and to 
libraries comprised of colonies of such cells. Alternatively, two of the vectors can be 

3 0 combined so that the host cell contains only two vectors; the vector containing two 

expression systems may maintain these as separate expression systems or two open 
reading frames may be placed under the control of a single promoter. 
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In another specific embodiment, the invention relates to a cell modified to 
contain a first vector containing a first selectable marker and an expression system for 
at least one minimal module of a modular poivketidc synthase (PKS) operably linked 
to a promoter operable in said cell, and a second vector containing a second selectable 
c - marker and a nucleotide sequence encoding at least a second minimal module of a 
modular polyketide synthase operably linked to a promoter operable in said cell, and 
to libraries comprising colonies of such cells. 

In another variation, one or more expression systems for a defined portion of a 
PKS svstcm js integrated into the host chromosome and at least one additional 

1 ' expression system resides on a replicable vector Thus, in the case of aromatic PKS, 
an expression system for one of the open reading frames may first be integrated into 
the chromosome and expression systems for other open reading frames may reside on 
vectors. In the case of a modular PKS, an expression system for one or more modules 
mav reside on the chromosome and additional expression systems for one or more 

1 5 modules reside on vectors. The integration of such expression systems into the 
chromosome can occur either through known phage-mediated integration or by 
homologous recombination. 

The invention also is directed to novel polyketides produced by the methods of 
the invention and to methods to screen the polyketide libraries obtained. 

2 0 In still another aspect, the invention is directed to methods to obtain the 

synthesis of polyketides in hosts that lack a mechanism for activation of the acy! 
carrier proteins - i.e., which lack holo ACP synthases. By supplying an expression 
system for a compatible holo ACP synthase either on a separate vector, on one of the 
vectors in a multiple vector system (or on a single vector for PKS expression), or as a 

2 5 fusion protein with a PKS or portion thereof, hosts such as E. co/i, yeast, and other 

microbial systems which do not customarily synthesize polyketides can be made into 
convenient hosts. This obviates the necessity for supplying "clean" hosts from 
polyketide-producing strains of, for example, Streptomyces. 



3 ( ) Brief Description of the Dr awings 

Figure 1 is a diagram showing the composition of several typical aromatic 

PKS. 
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Figure 2 is a diagram showing the organization of erythromycin PKS as 
typical of a modular PKS 

Figure 3 is a diagram showing the organization of the fungal PKS system, 
o-methyl salicylic acid synthase (6-MSAS) 
:7: Figure 4 is a diagram which shows the conceptualization of a multivectored 

modular PKS system. 

Figure 5 is a diagram of a multivectored aromatic PKS system. 

Figure 6 shows, diagrammatically, the construction of a vector for expression 
of a holo-ACP synthase and a vector for the expression of the gene encoding 
1 0 6-MSAS, both vectors for use in yeast. 

Figure 7 shows the results of HPLC run on supernatants of yeast cultures 
transformed with various vectors of the invention 

Figures 8 A and 8B show the kinetics of production of the antibiotic 6-methyl 
salicylic acid (6-MSA) in yeast (Figure 8 A) and in A. coli (Figure 8B). 

1 5 Figure 9 shows the expression systems for two modular PKS for use in vectors 

compatible with yeast along with the expected products. 

Mod es of Carrying Out the Invention 

The invention in various aspects employs various components of the aromatic, 
2D PKS system, the modular PKS system, a fungal PKS system, or modified forms 
thereof or portions of more than one of these. The general features of aromatic, 
modular and fungal PKS systems are shown in Figures 1, 2 and 3 respectively. 

"Aromatic" PKS systems are characterized by the iterative use of the catalytic 
sites on the several enzymes produced. Thus, in aromatic PKS systems, only one 

2 5 enzyme with a specific type of activity is produced to catalyze the relevant activity for 

the system throughout the synthesis of the poiyketide. In aromatic PKS systems, the 
enzymes of the minimal PKS are encoded in three open reading frames (ORFs). As 
shown in Figure I, the actinorhodin PKS is encoded in six separate ORFs. For the 
minimal PKS, one ORF contains a ketosynthase (KS) and an acyltransferase (AT); a 

3 0 second ORF contains what is believed to be a chain-length factor (CLF); and a third 

reading frame encodes an acyl carrier protein ( ACP). Additional ORFs encode an 
aromatase (ARO), a cyclase (CYC), and a ketoreductase (KR). The combination of a 
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KS/AT, ACP, and CLF constitutes a minimal PKS. since these elements are necessary 
for a single condensation of a two-carbon unit. 

On the other hand, the gris PKS contains five separate OilFs wherein the 
KS'AT, CLF, and ACP are on three ORFs, the KR is on a fourth, and the ARO is on a 
I fifth 

In the "modular" PKS systems, each catalytic site is used only once and the 
entire PKS is encoded as a series of "modules." Thus, the modular synthase protein 
contains a multiplicity of catalytic sites having the same type of cataivtic activity. A 
minimal module contains at least a KS, a*. AT and an ACP Optional additional 

0 activities include KR, DH, an enoylreductase (ER) and a thioesterase (TE) activity. 
Figure 2 shows, diagrammatically, the organization of the modular PKS system for 
the synthesis of the immediate precursor, 6-d.EB, for the antibiotic erythromycin As 
shown, there is a loading region followed by six modules; the thioesterase on module 
6 effects release of the completed 6-deoxyerythronolide B (6-dEB) from the synthase 

5 to which it is coupled through a phosphopantotheinyi group. The diagram shows the 
progressive formation of the 6-deB which is cyclized after removal from the hoio 
ACP on module 6 of the synthase. To convert 6-deB to erythromycin A, two sugar 
residues are added in subsequent reactions through the hydroxy! groups at positions 3 
and 5. 

0 The tfc fungal" PKS encoding 6-methyl salicylic acid synthase (6-MSAS) has 

some similarity to both the aromatic and modular PKS. It has only one reading frame 
for KS, AT, a dehydratase (DH), KR and ACP. Thus, it looks similar to a single 
module of a modular PKS. These sites are, however, used iteratively. Unlike an 
aromatic PKS, it does not include a CLF, as shown in Figure 3. 

5 The invention herein employs expression systems for the catalytic activities 

involved in alt of the aromatic, modular and fungal PKS systems. The proteins 
produced may contain the native amino acid sequences and thus the substrate 
specificities and activities of the native forms, or altered forms of these proteins may 
be used so long as the desired catalytic activity is maintained. The specificity and 

f> efficiency of this activity may, however, differ from that of the native forms. Certain 
activities present in the native system, however, can be intentionally deleted Further, 
components of various aromatic systems can be mixed and matched, as well as can 
components of various modules of the module systems. PCT application 
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WO 95/0S54S, referenced above and incorporated herein by reference describes the 
construction of hybrid aromatic PKS systems where, for example, open reading 
frames of actinorhodin arc included in expression vectors with open reading frames 
from alternative aromatic systems 
0 Expression systems for the PKS proteins alone may not be sufficient for actual 

production of polyketides unless the recombinant host also contains holo ACP 
synthase activity which effects pantothenoylation of the acyl carrier protein. This 
activation step is necessary for the ability of the ACP to ^ pick up" the unit 
which is the starter unit or the growing polyketide chain in the series of Cl'^en 

10 condensations which result in the finish. polyketide. For hosts lacking a 
phosphopantothenoylating enzyme that behaves as a holo ACP synthase, the 
invention provides means for conferring this activity by supplying suitable expression 
systems for this enzyme. The expression system for the holo ACP synthase may be 
supplied on a vector separate from that carrying a PKS unit or may be supplied on the 

1 5 same vector or may be integrated into the chromosome of the host, or may be supplied 
as an expression system for a fusion protein with all or a portion of a polyketide 
synthase. In general, holo ACP synthases associated with fatty acid synthesis are not 
suitable; rather, synthases associated specifically with polyketide synthesis or with 
synthesis of nonribosomai proteins are useful in this regard. 

2 0 Specifically, the modular and fungal PKS systems are not activated by 

phosphopantothenoylation effected by the phosphopantothenoylation enzymes 
indigenous to IC. col/: however, enzymes derived from Bacillus, in particular the 
gramicidin holo ACP synthase of Bacillus hrevis and the surfactin-related holo- ACP 
synthase from Bacillus subtilis can utilize the modular and fungal PKS ACP domains 

2 0 as substrates. As shown in the Examples below, while inclusion of an expression 

system for an appropriate holo- ACP synthase is not necessary for just the expression 
of the genes encoding fungal or modular PKS in E. coli or yeast, inclusion of such 
expression systems is required if polyketides are to be produced by the enzymes 
produced. 

3 0 It should be noted that in some recombinant hosts, it may also be necessary to 

activate the polyketides produced through postsynthesis modifications when 
polyketides having antibiotic activity are desired. If this is the case for a particular 
host, the host will be modified, for example by transformation, to contain those 



WO 98/27203 _ g „ PCT7US97/23014 

enzymes necessary for effecting these modifications. Among such enzymes, for 
example, are glycosvlation enzymes. 

The combinatorial possibilities for synthesis of aromatic PKS systems depend 
on the nature of the iteratively used sites and the presence or absence of the optional 
activities that are not pail of the minimal PKS system required for the Claisen 
condensation which represents the synthetic mechanism for the end-product 
polyketide Thus, while the aromatic PK synthase must contain a KS/AT, ACP and 
CLF. the other catalytic activities, i.e KR, ARC), and CYC are optional Fungal PK 
synthases require omy KS, AT, and ACP functionalities, as do the modular PKS 

1 0 systems Various combinations of these activities from various sources can be used as 

well as their mutated forms. 

Because the catalytic sites are used only once in the modular PKS systems, the 
combinatorial possibilities in this type of synthase are greater, The combinatorial 
potential of a modular PKS is given by: AT L x ( ATj : x 4) M where AT] is the number 
15 of loading acyl transferases, AT[ is the number of extender acyl transferases, and M is 
the number of modules in the gene cluster. The number 4 is present in the formula 
because this represents the number of ways a keto group can be modified by either 
1 ) no reaction; 2) KR activity alone; 3) KR+DH activity; or 4) KR+DH+ER activity. 
It has been shown that expression of only the first two modules of the erythromycin 

2 0 PKS resulted in the production of a predicted truncated triketide product. See Kao, et 

ai J Am Chem Sac (1994) M6: 11612-11613. A novel 12-membered macrolide 
similar to methymycin aglycone was produced by expression of modules 1-5 of this 
PKS in S coclicolor. See Kao, C. etai J Am Chem Sac ( ! 995) n_7:9 1 05-9 1 06. This 
work, as well as that of Cortes, J. et ai Science (1995) 268: 1487-1489, shows that 

2 5 PKS modules are functionally independent so that lactone ring size can be controlled 
by the number of modules present. 

In addition to controlling the number of modules, the modules can be 
genetically modified, for example, by the deletion of a ketoreductase domain as 
described by Donadio, S. el ai Science (1991) 252:675-679. Donadio, S. et ai Gene 

30 (1992) 115:97-103. In addition, the mutation of an enoyl reductase domain was 

reported by Donadio, S. eta/. Proc Natl Acad Sci USA (1993) 90:7 1 1 9-7 123. These 
modifications also resulted in modified PKS and thus modified polyketides. 
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As stated above, in the present invention, the coding sequences for catalytic 
acti vines derived from the aromatic, fungal or modular PKS systems found in nature 
can be used in their native forms or modified bv standard mutagenesis techniques to 
delete or diminish activity or to introduce an activity into a module in which it was 
r not originally present. For example, a KR activity an be introduced into a module 
normally lacking that function. 

While the art, as set forth above, has succeeded in producing some novel 
polyketides by virtue of construction of hybrid and/or altered aromatic or modular 
PK C ;\ i.i Strepfomyces from a single expression vector, advantage has not been 
I 0 taken of using a multiple vector system in host cells generally in order to produce a 

wider variety of synthases. By "multiple" is meant two or more; by "vector" is meant 
a nucleic acid molecule which can be used to transform host systems and which 
contains both a selectable marker and an independent expression system containing a 
coding sequence under control of a promoter and any other suitable sequences 

1 5 regulating expression. Typical such vectors are plasmids, but other vectors such as 

phagemids, cosmids, viral vectors and the like can be used according to the nature of 
the host. 

Of course, one or more of the separate vectors may result in integration of the 
relevant expression systems into the chromosome of the host. 

2 0 Neither have microbial hosts generally, such as E. coli and yeast, been used 

successfully to construct polyketides. It is believed that this is due to the lack of holo 
ACP synthase which, according to the methods of the invention, can be supplied to 
these hosts. 

Thus, in order to produce the polyketides of the invention, suitable hosts are 

2 r modified to contain vectors, typically plasmids, which contain expression systems for 

the production of proteins with one or moic of the activities associated with PKS. By 
placing various activities on different expression vectors, a high degree of variation 
can be achieved. A variety of hosts can be used; any suitable host cell for which 
selection markers can be devised to assure the incorporation of multiple vectors can 

3 0 readily be used. Preferred hosts include yeast, E. co/l actinomycetes, and plant cells, 

although there is no theoretical reason why mammalian or insect cells or other 
suitable recombinant hosts could not be used. Preferred among yeast strains are 
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Saccharomyccs cerevisiac and lochia pasions Preferred actinomycetes include 
various strains of Sirc[)ioniyccs. 

The choice of hosts, of course, dictates the choice of the control sequences 
associated with the expression system as well as the selectable markers. Suitable 
: promoter systems, for example, for use in E. coli include the tryptophan (trp) 

promoter, the lactose (lac) promoter, the T7 promote!" and the A-derived l\ promoter 
and N-gene ribosome binding site. For yeast, suitable control sequences include 
promoters for the synthesis of glycolytic enzymes, such as 3-phosphoglycerate kinase 
father promoters kHnde tlK ^ for aL.^hol dehydrogenase ( ADH-l and ADH-2), 
10 isocytochrome-C, acid phosphatase, degradative enzymes associated with nitrogen 
metabolism and enzymes responsible for maltose and galactose utilization. It is also 
believed that terminator sequences are desirable at the 3* end of the coding sequences. 

Suitable promoters for use in mammalian cells, actinomycetes, plant cells, 
insect cells and the like are also well known to those in the art. 
1 5 Selectable markers suitable for use in bacteria such as E. coli and 

actinomycetes generally impart antibiotic resistance; those for use in yeast often 
complement nutritional requirements. Selectable markers for use in yeast include, but 
are not restricted to URA3. LEU2-d, TRP I LYS2, HIS I, H1S3. Selectable markers for 
use in actinomycetes include, but are not restricted to those for thiostrepton-, 
2 0 apramycin- hygromycin-, and erythromycin-resistance. 

Methods and materials for construction of vectors, transformation of host cells 
and selection for successful transformants are well understood in the art. 

Thus, according to one embodiment of the invention herein, a single host ceil 
will be modified to contain a multiplicity of vectors, each vector contributing a 

2 5 portion of the synthesis of a PKS system. In constructing multiple vectors for 

production of aromatic PKS systems, the separate reading frames such as those shown 
in Figure 1 may be incorporated on separate vectors or, if properly constructed, 
portions of reading frames can be distributed among more than one vector, each with 
appropriate sequences for effecting control of expression. For modular systems a 

3 0 single module or more than one module may reside as a part of an expression system 

on a single vector, multiple vectors are used to modify the cell to contain the entire 
desired PKS system, 



WO 98/27203 _ j j _ PCT/US97/230I4 

As stated above, one or more of the expression systems introduced into the 
host may be integrated into the chromosome. 

Thus, to prepare the libraries of the invention, suitable host cells are 
transformed with the desired number of vectors: by using different selectable markers 
: on each vector desired as part of the modification, successful transformants which are 
modified bv inclusion of all the desired vectors can be selected By using mixtures of 
a first vector with a first selectable marker containing a multiplicity of expression 
systems for a portion of a PKS synthase, and a mixture of a second vector with 
expression systems for a variety of a second portion of a PKS system, and so forth, 
1 v colonies of successful transformants are obtained that have a combinatorial 

representation of ''hybrid" PKS systems. By preparing panels of individual colonies 
of such successful transformants, a library of PKS systems is obtained and thereby a 
library of polyketides. An expression system for holo ACP synthase is also supplied 
if needed. The polyketides may be glycosylated depending on the nature of the host. 

1 5 This approach can also be modified by effecting the integration of the 

appropriate portion of one or more of the multiple vectors into the chromosome of the 
host. Integration can be effected using suitable phage vectors or by homologous 
recombination. If homologous recombination is used, the integration may also delete 
endogenous PKS activity ordinarily residing in the chromosome, as described in the 

2 0 above-cited PCT application WO 95/08548. In these embodiments, too, a selectable 

marker such as hygromycin or thiostrepton resistance will be included in the vector 
which effects integration. 

The libraries of polyketides can then be screened for activity with respect to 
any polyketide responsive target in order to identify particular polyketide members 

2 5 that will activate or otherwise bind to the target. Such screening methods are standard 

in the art. 

In a particularly preferred embodiment of the invention, the library can be 
made self-screening by introducing a polyketide-responsive receptor that is 
intracellular to or is displayed at the surface of the host cell producing the polyketide 

3 0 itself This "autocrine" system allows the colonies to self-select for those activating 

the receptor. Such systems are described, for example, in an article by Broach, J R 
and Thorner, J., y-iture ( 1 996) 3 84 : Supp . 7 ; 1 4- 1 6 . 
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Autocrine systems need not be limited, however, to receptors, but can include 
proteins that are expressed internal to the cell and whose interaction can be evaluated 
with respect to the polyketides produced, in a manner analogous to the yeast 2 hybrid 
system described by Fields in U.S. Patent 5,283 J 73. 

Thus, the cells are modified to create "cell-based detection systems for 
polyketide function. 11 The function of the polyketide may include agonist or 
antagonist activity with respect to a receptor which is cither produced at the surface of 
the cell or produced intracellularly, or the polyketides may be agonists or antagonists 
for two hybrid interact; cn screens so that it will oe possible to select for protein- 

1 0 protein interaction inhibitors or cross-linking factors analogous to rapamycin and 

FK506. 

It should be noted, that such cell-based detection systems are also useful in 
screening libraries of polyketides which a;\ produced from cells containing only 
single vector systems. Thus, these improvements are applicable not only to the 

2 2 multivector combinatorial libraries of the present invention but also to polyketide 

synthase and polyketide libraries produced using cells containing these systems on a 
single expression vector. 

As mentioned above, additional enzymes which effect post translational 
modifications to the enzyme systems in the PKS may need to be introduced into the 
2 0 host through suitable recombinant expression systems. In addition, enzymes that 
activate the polyketides themselves, for example, through glycosylation may be 
needed. It may also be necessary to modify the catalytic domains to alter their 
substrate specificity or to substitute domains with the appropriate specificity. For 
example, it is generally believed that malonyl CoA levels in yeast are higher than 

2 5 methylmalonyl CoA; if yeast is chosen as a host, it may be desirable to include 

catalytic domains that can utilize malonyl CoA as an extender unit, such as those 
derived from spiramycin or tylosin. 

Figure 4 diagrams one embodiment of the conceptual basis of the present 
invention wherein three separate vectors are employed to produce a modular PKS. As 

3 0 shown, each vector permits the construction of 64 different open reading frames using 

two extender ATs (one from methylmalonyl CoA and the other from malonyl CoA) 
and the four combinations involving KR, DH, and ER as described above. Thus, 
module No. 1 may employ malonyl CoA as an extender unit; module No. 2 
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methvlmalonvl CoA; the opposite sequence can be used, or both extenders might use 
malonyl CoA or both might use methylmalonyl CoA This results in four separate 
tvpes of extender combinations, each of which is multiplied by the four KR/DH/ER 
variants Each separate plasmid oiYer^ the same set of possibilities; one of the 
:> plasmids must also contain a loading function and one must contain a thioesterase 

function. Thus, by construction of 192 plasmids, the upper limit of synthesis of novel 
polyketides is 64x64x04 or 262,144 molecules, providing an efficient method to 
obtain large numbers of novel polyketides. 

Figure 5 shows an approach to a multiple vector aromatic PKS that is set forth 
] 0 in ga er detail in Example 1 1 hereinbelow In Figure 5. the three separate reading 
frames of a typical aromatic poiyketide synthase are placed on separate vectors. 
Thus, each reading frame can be derived from a different aromatic poiyketide 
synthase if desired. 

Another modification useful in varying the polyketides produced regardless of 

1 5 the host cell employed manipulates the PKS, in particular a modular or fungal PKS, to 

inactivate the ketosynthase (KS) on the first module. This permits enhanced 
efficiency in permitting the system to incorporate a suitable diketide thioester such as 
3-hydroxy-2-methyI pantonoic acid-N-acetyi cysteamine thioester, or similar 
thioesters of diketide analogs, as described by Jacobsen ei ai Science (1997) 277:367- 

2 0 369. The construction of PKS modules containing inactivated ketosynthase regions is 

described in copending U.S. application 08/675,817 and published in PCT application 
YVO97/0235S incorporated herein by reference. These modified PKS modules can be 
employed in the various embodiments of the invention in preparing libraries using 
multivector methods and/or in E. colt and yeast-based production organisms for the 

2 5 polyketides which may require the additional expression of a gene encoding a suitable 

holo-ACP synthase. 

Thus, the present invention provides the opportunity to produce polyketides in 
hosts which normally do not produce them, such as E. coli and yeast. The invention 
also provides more efficient means to provide a variety of poiyketide products by 

3 0 supplying the elements of the introduced PKS, whether in an E. coli or yeast host or in 

other more traditionally used hosts, on multiple separate vectors. The invention also 
includes libraries of polyketides prepared using the methods of the invention. 
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U ses of Polvketides 

As is well understood, the polvketides, in their glycosylated forms, are 
powerful antibiotics In addition, many polvketides are immunosuppressants and 
anticancer agents. It has also been found that polvketides or their glycosylated forms 
5 can reduce inflammation under certain circumstances. This is believed to be due to 
the ability of certain antibiotics to inhibit the release of cytokines such as IL-S. For 
example, Hotf M. in the Kmume Medical Journal ( 1996) 43:207-2 1 7 concludes that 
the favorable clinical effect of erythromycin in cryptogenic organizing pneumonia and 
related conditions is due to inhibition of acu; accumulation in the peripheral 

1 0 airways through local suppression of IL-S production, In further experimental work, 

Tamaoki, J. et al Antimicrobial Agents and Chemotherapy ( 1996) 40: 1726-1728 
showed that pretreatment of guinea pigs with roxithromycin or erythromycin inhibited 
the increase in goblet cell secretion when IL-S was inhaled. Hamada, K. et al. 
Chemotherapy (1995) 44 : 59-69 showed that the antitumor effect of erythromycin in 
1 5 mice was due to enhancing the production of 1L-4. In another study, Keicho, N. et aL t 
Journal of Antibiotics (Tokyo) (1993) 46: 1 406-1 413, state that erythromycin has been 
reported to depress the extent of inflammation independent of its antimicrobial action 
and show that erythromycin suppresses the proliferative response of human 
lymphocytes stimulated with mitogens and antigens but had no effect on 

2 0 concanavilin-A induced IL-2 production or IL-2R-0. expression. Bail ly, S. et al 

Antimicrobial Agents and Chemotherapy ( 1 99 1 ) 35 : 20 1 6-20 1 9 showed that 
roxithromycin, spiramycin and erythromycin have differing effects on production of 
IL- 1 a, IL- 1 [3 and IL-6 as well as tumor necrosis factor a . Spiramycin, and to a lesser 
extent, erythromycin increase total IL-6 production without affecting IL-la, IL-1|3 or 

2 5 TNFa. Roxithromycin had no effect. 

Thus, there are a number of papers which indicate that antibiotics are also 
important in modulating inflammatory mechanisms. The literature appears to show 
that erythromycin diminishes the production of IL-8, but enhances the production of 
IL-6, IL-1 and IL-2. Spiramycin has been shown to enhance the production of IL-6. 

3 0 

These examples are intended to illustrate but not to limit the invention. 
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C onst ructio n of 102 (1, a 6-_\'l, SAS Yeast Express io n Vec tor 
Control sequences effective in yeast were obtained and inserted into plasmid 
pBlueScript (Stratagene) along with a polylinker. The S. ccrevcsiae ADH2 promoter 
r was amplified by PCR using the following primers: 

forward GGGAGCTCGGATCCATTTAGCGGCCGCAAAACGTAGGGGC 
reverse: 

CCGAATTCTAGAGGTTTCATATGGTATTACGATATAGTTAATAG. 

The forward primer contains 15 bases complementary to the 5' ADH2 

1 !■ sequence and introduces Sac I (nucleotides 3-8), Ham HI (nucleotides 9-14), and NotI 

(nucleotides 20-27) restriction sites The reverse primer contains 15 bases 
complementary to the 3' ADH2 sequence and introduces Ndcl (nucleotides 18-23), 
XhaJ (nucleotides 7-12), and EcoRI (nucleotides 3-8) sites. 

The ADH2 terminator was amplified by PCR using the following primers: 
15 forward: 

CiGGAATTCATAGTCGACCGGACCGATGCCTTCACGATTTATAG 
reverse: 

TTTTCTATTATAAGATGAAAAACGAGGGGAGCTCCCATGGCC. 

The forward primer introduces EcoRI (nucleotides 3-8), Sail (nucleotides 12- 

2 0 1 7), and RsrII (nucleotides 1 7-24) restrictions sites. The reverse primer introduces 

XJioI (nucleotides 29-34) and Asp7 18 (nucleotides 35-40) restriction sites. 

The SacIIEcoRI fragment containing the ADH2 promoter, the EcoRIIAsp? 18 
fragment containing the ADH2 terminator, and the SacIIAsp? 18 fragment of 
pBlueScript were ligated to produce an intermediate vector, 43d2 which contains 
25 cloning sites (L2) for 6MSAS and the gene for the surfactant phosphopantothein 
transferase from B. sitbtilis (the sfp gene). See Figure 6. It also contains sites (LI, 
L3) for transferring the promoter/terminator cassette into yeast shuttle vectors as well 
as sites (LI, L2) for moving the promoter/gene cassettes from the intermediate 
BlueScript vector into the yeast shuttle vector. 

3 0 The ADH2 promoter/terminator was then introduced into the E. c<?//7yeast 

shuttle vector pYT (a gift from Dr. S. Hawkes, University of California, San 
Francisco). The 13.2-kbp BamHIISall restriction fragment from pYT was ligated to 
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the 757-bp BamHILXhoI restriction fragment from 43d2 to yield plasmid 101c. which 
contains Leu and Lira markers for selection 

To complete construction of the expression vector, a 5.3-kbp NdeCXhal 
restriction fragment containing the gene for 6-methylsulicvlic acid synthase (6- 
MSAS) from Pctiialliwu patulum was obtained from demethylated plasmid pDB102 
(Bedford, D.. et al^ J Bacteriology (1995) 177:4544-4548) and ligated into 
/We//,\7;c//-restricted 43d2. yielding intermediate plasmid 7 Id The 6. 1-kbp 
NoillRsrll restriction fragment from 7 Id was ligated to the 12.6-kbp NotllRsrll 
restriction fragment from 101c to produce the expression vector 102d 

10 

Example 2 

Exp res sion of 6-MSAS in Saccharomvces c erevesiae 
Competent Saccharomyccs ccrcvcsiac InvScl (MATa his3Dl leu2 trpl-289 
ura3-52) (Invitrogen) was transformed with I02d, then plated on minimal agar plates 

1 b (1.7 g/L yeast nitrogen base without amino acids or ammonium sulfate (DIFCO), 

5 g/L (NHO2SO.J, 20 g/L glucose, 20 g/L agar containing amino acids for selection 
based on uracil prototrophy. Transformants were picked and grown for 24 hours in 
uracil-deficient minimal medium. Plasmid DN A was isolated from the transformants 
and analyzed by restriction digestion analysis to confirm identity, 

2 0 A successful transformant was used to inoculate 2 mL of uracil-deficient 

minimal medium and was grown overnight at 30°C on an orbital shaker. A 100-uL 
aliquot of this culture was used to inoculate 10 mL of YPD medium (Wobbe, C.R., in 
Current Protocols in Molecular Hiolog\\ Supplement 34: 13.0. 1-13. 13.9 (Wiley, 
1996)) (10 g/L yeast extract, 20 g/L peptone, 20 g/L glucose), and the culture was 

2 5 grown at 30°C on a shaker. 

Cells were collected by centrifugation of 500 uL-aliquots of the culture taken 
after 1 8 and 36 hours of growth and lysed by boiling in 50 uL of 2x SDS gel loading 
buffer for 2 minutes. 

The cell lysates were analyzed by loading onto 12% SDS-PAGE gels. A band 

3 0 corresponding to the expected size of 6-MSAS was observed at ca. 190 kD. 
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Examp le 3 

Con stru ction of a Holq ACP S ynthase Expression Vecto r 
The Bacillus subahs sfp gene encodes a hole ACP synthase, i.e.. a 
phosphopantothenoyl transferase, and is inserted into plasmid YepFLAG-1 
(IBI/Kodak) 

The 5.7-kbp PacBNotI restriction fragment of YepFLAG-1 was ligated with a 
synthetic polylinker to introduce the following restriction sites: 

(Pad) - Ham HI - Noll - NcoJ - RsrII - Xhol - Sail - (Noll). 

The original Pad and Noil ligation sites were destroyed in the ligation. The 
1 resulting vector was cut with BamHI and Sail and was ligated to BaniHI/XhoP 
digested 43d2 (see Example 1) to introduce the ADM2 promoter/terminator, thus 
obtaining the plasmid 126b. The Bacillus subiilts sfp gene was amplified from the 
plasmid pUCS-sfp (Nakano. M. et ai A to! Gen Genet < 1902) 232:3 13-32 1 ) by PCR 
using the primers: 

1 5 forward: TAGACACATATGAAGATTTACGGAATTTATATG 

reverse: TACATTCTAGAAATTATAAAAGCTCTTCG. 
The forward primer introduces aNdel restriction site (nucleotides 7-12) and 
the reverse primer introduces an Xbal site (nucleotides 6-11). 

The resulting PCR fragment was ligated into the NJd and Xbal sites of 43d2 

2 0 to produce plasmid 109c. 

The 1 .3-kbp BamHPSall restriction fragment of 109c was ligated to 
BaniHI/Sa/I-d\gcsttc\ 126b to produce expression vector 128a which contains the sfp 
gene under control of the ADH sequences and tryptophan prototrophy as selection 
marker 

o c 

Example 4 

Production of 6-methylsalicylic Acid in Yeast 
Competent Saccharomyces cerevesiae InvScl cells were transformed with 
102d (6 MS AS) and 128a (sfp holo ACP synthase). 128a was used in the first 

3 0 transformation with selection for tryptophan prototrophy, a successful transformed 

was then transfected with 102d, with selection for tryptophan and uracil prototrophy. 
Transformants appeared after 48-72 hr at 30°C. 
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Single colonics of the (> MSAS.'sfp transformams were grown 24-48 hr at 30°C 
in" tryptophan- and uracil-deficient minimal medium, after which 100 ul was used to 
inoculate 10 ml of YPI) medium. Cultures were grown for IS hr at 30°C in an orbital 
shaker at 225 rpm. YPD medium (Mi nil) was inoculated with 0.> ml of the overnight 
cultures and incubated at 3()°C for 142 hr One ml aliquots were removed 
periodically and the cells were collected by centrifugation The cells were suspended 
m SDS-PAGt loading buffer, boiled for 2 min and subjected to SDS-PAGE to 
determine the production of the PKS protein. The supernatants were analyzed for 6- 
methvlsalicvlic acid production by injection of 20 ul. onto an HPLC (CIS reverse- 

1 ; : phase column, water/acetonitrile/acetic acid gradient, diode-array L r V detection). The 

LC parameters were as follows. Solvent A = 1% acetic acid in water; Solvent B = 1% 
acetic acid in acetonitrile; gradient - 20% B to 80% B in 30 min then to 100% in 2 
min; flow rate ~ 0.5 ml/min. The amount of 6-methyIsalicylic acid was quantitated bv 
peak integration at 307 nm A standard curve was generated using authentic 6- 
15 methylsaiicylic acid (Seidef J.L. etaL.JChem Ecology (1990) J6: 1 791-1816). 

The results of a typical experiment are shown in Figure 7. Yeast which 
contained only the control plasmid 101c or control plasmid and the sfp expression 
plasmid 128a produced no 6-MSA (trace b, d). Yeast containing only the 6-MSAS 
expression vector 102d produced a barely detectable amount of 6-MSA (trace c). 

2 0 Yeast containing both the 6-MSAS expression vector 102d and the sfp expression 

vector 128a produced as much as 1 .7 g/1 of 6-MSA (trace a). 

The kinetics for yeast growth and 6-MSA production for the transformant are 
shown in Figure 8A. As shown, the open squares represent growth as measured by 
ODooq. The closed circles represent the production of 6-MSA in g/L. The production 

2 5 of 6-MSA begins when glucose is depleted consistent, with derepression of the ADH2 

promoter. A plateau was reached after about 60 hr of growth and remained constant 
up to I 50 hr. 

For large-scale preparation of 6-MSA, a 500 ml yeast culture harboring the 
two plasm ids was grown for 120 hr and the cells were removed by centrifugation. 

3 0 The supernatant broth (280 ml) was acidified with 28 ml glacial HO Ac, then extracted 

with 280 ml ethyl acetate. The organic extract was concentrated to dryness under 
reduced pressure. The crude product was purified by crystallization from water and 
the crystals were dried under vacuum over KOH. The identity of 6-MSA was 
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confirmed by NMR and mass spec. In the specific experiment described above, the 
280 mi of cell-free yeast culture yielded 240 mg of 6-MSA as crystalline needles. 
Shake flask cultures typically produced over 1 g/L of (>-mcthyisaiicvlic acid. 

Example 5 

Construct ion of the DEBS M odule 6 K. R-AC P-TE Expres sion Vector, Plas mi d 104 
The plasmid, 90, which contains a T5 promoter, 2 lac operators, and lac lM [?] 
was constructed by ligating a ] . ] -kbp XhoIiXhal fragment of pQE60 (Qiagen) to the 
larger XhoUXbal fragment of pET22b(-> f v ^v-""'cn). A PstEEcolU restriction 
I " fragment containing the DNA encoding module 6 JvR-AC" -TE was ligated int; 
plasmid 90 to give plasmid 104. an expression vector for this module. 

Exa mple 6 

Phosphopan tothen ovlation of M odule 6 K R-ACP-TE 

1 A. / // vivo : 

The p-alanine auxotroph Escherichia coli SJ16 {E. coli Genetic Stock Center), 
was cotransformed with 104 and a holo-ACP synthase expression plasmid containing 
genes for either: 

E. coli fatty acid synthase holo-ACPS (ACPS); 

2 0 E. coli enterobactin synthetase holo-ACPS (EntD), or 

Bacillus hrevis gramicidin synthetase holo-ACPS (GsP). 
Holo-ACPS expression plasmids were generous gifts of Dr. Daniel Santi, 
UCSF (Ku t J., et aE Chemistry & Biology ( 1997) 4:203-207). 

Each cotransformant was grown in minima! medium E (Vogef H.J. et ai, J 

2 3 Biol ("hem (1956) 218:97-1 06) supplemented with 0.001% thiamine, 0.01% 

methionine, and 100 uM p-alanine at 37°C for 20 h. Cells were collected by 
centrifugation and washed with 1 mL of growth medium without P-alanine. This 
wash was repeated four times. Finally, the cells were incubated in 1 mL of growth 
medium without p-alanine at 37 C C for 6 h. 

3 0 A 30-uL aliquot of the starved cells was added to 1 mL of growth medium 

supplemented with 0.52 uM [3H]-|3-alanine (1 uCi, American Radiolabeled 
Chemicals, Inc.). After 6 h at 37°C, the cells were induced by addition of IPTG to 1 
inM, kept for an additional 3 h at 37°C, and centrifuged. The cell pellet was boiled in 
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SDS gel loading buffer, then analyzed on a 10° o SDS-PAGE gel. The gel was stained 
with Coomassie Blue, photographed, soaked in Amplify ( Amersham). dried, and 
autoradiographed using Kodak Bio-YIAX film for 2 days 

The module 6 KR-ACP-TE fragment of DUBS was efficient ]y labeled upon 
:T coexpression with GsP and with EntD, while no labeling was observed upon 

coexpression with ACPS. The inability of ACPS to actuate the DEBS fragment is 
expected based on the known inactivity and lack of phosphopentothenoylation of the 
DEBS protein when expressed in E. coli (Roberts c( aL Eur J Biochem (1993) 
214 "• 

10 B [lL)'lU:o- The module 6 KR-ACP-TE fragment of DEBS was purified 

from E. coli : transformed with pi 04 using a NT 2 affinity column following 
manufacturer's directions (Invitrogen). Purified surfactm synthetase holo-ACPS (sfp) 
from Bacillus suhtilis was a gift of Dr. Christopher Walsh (Harvard Medical School). 
Labeled 3H~coenzyme A was a gift of Dr. Daniel Santi (UCSF). 

1 5 All assays were performed in 10 mJVl MgCE, 50 mM Tris-HCl (pH 8.8), in a 

total volume of 100 uL, and contained 40,000 cpm of 3H-coenzyme A and 0.39 uM 
sfp. A positive control contained 1.8 uM PheAT domain from gramicidin synthetase 
(Dr. Daniel Santi, UCSF) which is normally pantothenoylated by sfp. Reactions were 
kept 12 h at 37 C C then boiled in SDS gel loading buffer and analyzed on a 10% SDS- 

2 0 PAGE gel. The gel was stained with Coomassie Blue, photographed, soaked in 

Amplify (Amersham), dried, and autoradiographed using Kodak Bio-MAX film for 2 
days. 

Both PheAT and the module 6 KR-ACP-TE fragment of DEBS were 
efficiently labeled by sfp. 

2 5 

Example 7 

P roduction o f 6-methylsalicylic acid in Escherichi a coli 
The plasmid 90 (see Example 5) was converted to p95 by inserting a linker 
between the EcoRI/HuiJIlI in plasmid 90 so as to introduce restriction sites Ndcl and 

3 0 Spel adjacent to the T5 promoter. The 6-MSAS expression vector, 109, was 

constructed by ligating a Nclcl/Xhal fragment containing the 6-MSAS open reading 
frame (Pfeifer, E. et aL Biochemistry (1995) 34:7450-7459) with the large Ndcl/Spel 
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fragment of 95 leaving about I kbp of the linker between the Spcl and Hindi II sites of 
the vector 

The sfp expression vector, 108. was made by ligating a I . I -kbp EcoRIIPvuII 
restriction fragment of pl-CS-sfp (see Example 3) to pACYC-184 (New England 
r Biolabs) cut with EcoR J 'after fill-in of the EcoR/ site by DNA polymerase I The 
orientation of the sfp gene with respect to the promoter was verified by Hmdlll 
digestion. 

Plasmids 108 and 109 were cotransformed into E. coli C2453, and 
transfonnants were selected by chloramphenicol and ampicillin resistance. A single 

10 colony containing both plasmids was grown in ATCC medium 765 supplemented 

with 10% glycerol at 37 Q C to a density of 1.0 OD tjU0 then cooled to 30°C and induced 
by addition of 0 5 mJVl IPTG. Cell growth was continued for 36 hr at 30 °C. Protein 
expression was checked by 10°/o SDS-polyacrylamide gel. The formation of 6- 
methyisalicyiic acid was followed by HPLC analysis of the culture broth. 

1 5 The concentration of 6-MS A was estimated as described in Example 4 from a 

plot of concentration vs integrated are a of corresponding HPLC peak using an 
authentic sample. The identity of the product was confirmed by LC-mass 
spectroscopy, which revealed [M+H]+ = 153, with a major fragment at m/z =135 
corresponding to loss of H2O. Under these conditions, the culture produced 50 mg/L 

2 0 of 6-methylsalicylic acid. 

The production of 6-MS A in E. coli was dependent on the presence of the 
plasmid encoding the sfp protein. E. coli transformed with only the 6-MS AS 
expression vector, 109, when induced by IPTG followed by incubation at 37°C for 
4 hr, showed production of the approximately 190 kD 6-MSAS at about 5% of total 

2 5 protein. However, most of the protein was insoluble and 6-MS A was not detected in 

the medium. When the [3-alanine auxotre ( K . E coli SJ16 containing the 6-MSAS 
expression vector 109 was incubated with labeled (3-alanine before and after 
induction, no radioactivity was found in the 6-MSAS band on SDS-PAGE; thus, it 
appears the 6-MSAS was not modified with the phosphopantotheinyl cofactor by 

3 0 endogenous transferase. In a similar experiment involving E. coli SJ 16 cotransformed 

with both plasmid 108 and 109, a detectable amount of radioactivity was found in the 
190 kD 6-MSAS band; however, no 6-MS A was detected under these conditions. 
However, when the temperature of incubation was lowered to promote proper protein 
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folditm and glycerol was added to the medium to increase levels of intracellular 
malonvl CoA substrate, production of O-MSA was improved. Thus, when cells were 
grown at 3(VC in the absence of glycerol or at 37 J C in the presence of 10% glycerol, 
no 0-MSA was produced However, when grown as described above at 30 P C in the 
r presence of 1 0 0/ o glycerol, 6-MS A was produced up to about 75 mg'L after 24 hr of 
incubation The kinetics of production are shown in Figure SB. 

ExanyTe^S 

Production of 6-me th ylsalic vl ; \.?nd in Sacc haromyccs cerevesta e 

1 0 usi ng a PKS-holo A CP sy nthase f usion prote in 

A fusion protein between the Penicillium patuluw 6-methylsaIicylic acid 
synthase (6-MSAS) and the Bacillus suhtilis surfacfm holo ACP synthase (sfp) was 
made as follows: 

A 5.3-kbp NcIc//H/ncJ/I! fragment containing the 6-MSAS gene (see Example 
1 5 1 ) was ligated with a 708-bp Hindlll/XbaJ fragment containing the sfp gene (see 
Example 3) and with NdellXhal- restricted 43d2 (see Example I ) to produce 
intermediate plasmid 69. A ca. 6-kbp NolIIRsrll restriction fragment from 69 was 
ligated with NotllRsrII-vtsincXtA 101c (see Example 1 ) to yield the yeast expression 
vector 26a 1 (see Example 1). This vector contains the 6-MSAS/sfp fusion gene 

2 0 between the ADH2 promoter/terminator pair. 

The resulting fusion protein consisted of connecting the C-terminal lysine of 
6-MSAS with the N-terminal methionine of sfp using an (alanine).-* linker, such that 
the DNA sequence of the gene in the region of the fusion was: 

5'-AAGCTTGCCAAA-GCCGCCGCC-ATGAAGATTTAC-3' 

2 5 where the lysine and methionine codons are underlined. 

Transformation of.V. cerevcsiae InvScI with 26a i and culturing as described 
in Example 3 resulted in production of 6-methylsalicylic acid at a level comparable 
with that resulting from expression of 6-MSAS and sfp as separate genes. The fusion 
protein thus combines the enzymatic activities of 6-MSAS and of sfp, self 

3 0 phosphopantothenoylates, and produces polyketide product. 

This is especially useful for transformation of hosts where the number of 
plasmid replicons useable for expression vectors is limited, where polycistronic 
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messages are not properly processed, or where transformation with multiple vectors is 
difficult and/or time-consuming 

Example^ 1 

: Production of 6-deoxyerythronolide 0 by mixed chromosonuil/pl asmid expression 

systems in Sfre ptomvces li \jdans_ jus ing diKl(l\Qsom:ijJ. Qtegra tion 
To demonstrate the feasibility of dividing the three DEBS genes between 
chromosomal and plasmid expression systems, two experiments were performed. In 
both experiments, the integrating vector pSETl : : 2 (Biermam M, et a/., Gene (1992) 
ii.6.43-49) was used to place one gene ae DEBS gene cluster under control of the 
actinorhodin promoter onto the Streptomyces chromosome at the phage attachment 
site. The remaining genes were placed onto the replicating plasmid, pRM5 
(McDaniel el ctl. , Science ( 1 993) 262; 1 546- 1 550). also under control of the 
actinorhodin promoter. 

1 5 A. The ery AIII gene (encoding modules 5 and 6 and the thioesterase of 

DEBS) under control of the actinorhodin promoter was cloned into pSET152. The 
resulting vector was used to transform S. lividans K4-1 14. a strain in which the 
actinorhodin gene has been deleted by homologous recombination by standard 
methods (US patent application 08/238,Sl 1 incorporated herein by reference). 

2 0 Apramycin-resistant transformants were selected. 

An expression plasmid was constructed by cloning the eryAI and eryAH genes 
{containing modules 1+2 and 3^4, respectively) into the PacIIEcoRl sites of pRM5 so 
that the two genes were under the control of the actinorhodin promoter. This plasmid 
was used to transform protoplasts of the S. lividans clone containing the integrated 

2 5 ery AM gene, and colonies resistant to both thiostrepton and apramycin were selected. 

B. Alternatively, the actinorhodin promoter and the eryAI gene were 
cloned into pSET152 and subsequently integrated into the .V. lividans chromosome. 
The eryAII and ery AIII genes were cloned into pRM5 behind the actinorhodin 
promoter, and this plasmid was used to transform the S. lividans strain containing the 

3 0 integrated eryAI gene. 

Randomly selected colonies of the above organisms containing mixed 
chromosomal-plasmid expression systems were cultured on R2YE medium over 
XAD-16 resin, and ethanol extracts of the resin collected after 7 days were analyzed 
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for production of 6-deoxverythronoiide B by LC/'mass spectrometry Cultures from 
both experiments A and B produced 6-deoxyerythronolide B at levels of 15-20 mg/U 
comparable to that found in extracts of cultures of S. Imtlans containing pCK7, a 
replicating plasmid containing all three ery A genes under control of the actinorhodin 
S promoter. 

Example 1 0 

Prod uction of 6-deo xyerythronolide B by mi x ed c hrom osom al/plasmid expression 

_s_, : \ : ms in Strc piom yccs li vkkm s 
1 0 . v n alternative method for constructing a mixed chromosomal-plasmid 

expression system for multi-gene PKSs also achieves simultaneous creation of a clean 
host for polyketide production A suitable expression host, which normally produces 
a polyketide product, has its chromosomal PKS genes replaced by a subset of the 
foreign PKS genes through homologous recombination. This accomplishes the 

1 5 desired chromosomal integration of the foreign PKS genes while simultaneously 

eliminating interference from and competition by the native PKS. The example is 
readily illustrated for S. coclwolor and S. lividevis, both of which make the blue 
polyketide actinorhodin. 

A method by which the entire actinorhodin gene cluster is removed from these 

2 0 organisms and replaced with an antibiotic marker through homologous recombination 

has been described (US patent application OS/238,81 1). This method is adapted as 
follows: The recombination vector consists of any vector capable of generating 
single-stranded DNA (e.g., pBlueScript) containing the following elements: 1) a DNA 
sequence homologous to the 5' 1-kbp end of the act cluster, 2) a resistance marker 

2 5 (e.g., hygromycin or thiostrepton); 3) the act ll-orf4 activator gene; 4) the act 

promoter; 5) one or more genes of the foreign PKS; and 6) a DNA sequence 
homologous to the 3' 1-kbp end of the act cluster. Transformation of S. coelicoior or 
S. lividans with the recombination vector followed by selection for hygromycin 
resistance and screening for loss of blue color provides a host lacking the actinorhodin 

3 0 gene cluster and containing a chromosomal copy of the foreign PKS genes along with 

the needed actinorhodin control elements This host is subsequently transformed by 
replicating vectors (e.g., SCP2*-based plasmids) and/or with integrating phage 
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vectors (e.g., pSETI52) containing other genes of the foreign PKS to complete the set 
of PKS genes and produce polvketide product. 

Exampl e 1 1 

r: Construction of Ye ast V ectors f or Ex pression of an Aromatic Minimal PKS 

The genes encoding the KS/AT Afunctional protein and the CLE gene of the 
actinorhodin PKS (diagrammed in Figure 5) are amplified and tailored by PCR and 
cloned into the yeast expression vector pYEUra3 (Clontech) under control of the Gall 
and Gal 10 promoters respectively. The ACP gene is amplified and cloned together 

I with the noio-ACP synthase gene, if necessary, into a piasmid derived from pYEUra3 
by replacement of the Ura3 gene with the Lett2-d gene. Expression is also driven by 
the Gall and Gal 10 promoters respectively. Yeast strain BJ2168 is cotransformed 
with these plasmids and also with piasmid 128a (see Example 3) and transformants 
selected on a uracil- and leucine-deficient plates by standard methods. Expression is 

1 : induced by growth in 2% galactose according to the manufacturer's instructions. The 
polyketide produced by this synthase system is predicted to be 

H 3 C OH 
0 

I I 
i 

; i 

HO 

0 

; I 
HO 0 



E xample 12 

2 0 Constru ction of Yeas t Vectors for Expression of Modular Synthase Activities 

Two vectors are constructed. One contains the putative two-module system of 
spiramycin under control of the ADH-2 promoter and colinear with the thioesterase 
domain of the erythromycin PKS. The coding sequence construct is engineered to be 
flanked by an NJcl site at the initiation codon and an NsiJ site following the 

2T> termination codon; this construct is cloned using synthetic oligonucleotide linkers into 
pYT. 
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In the second vector, the analogous structure from the erythromycin PKS 
system flanked by A</W and Xsi! sites as described by Kao, C. ct al J Am Chcm Soc 
( I W ) 1_1.?:9 10> c H0o is cloned into pYT so as 10 be placed under control of the 
ADH-2 promoter Figure 9 shows the relev ant expression portion of these vectors and 
: _ the expected poiyketide products 
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Claim s 

1 A modified recombinant host ceil, which, in unmodified form, does not 
produce polyketides. which cell is modified to contain an expression system for a 
::■ minimal polyketide synthase (PKS) and an expression system for a holo ACP 
synthase, 

said minimal PKS comprising a kerosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACP) activity for an aromatic PKS, or 

10 said minimal PKS comprising a KS catalytic region, an AT catalytic region, 

and an ACP activity for a modular PKS or a fungal PKS 

2. The modified cell of claim 1 which \s E. coli or yeast. 

1 5 3. The modified cell of claim 1 wherein said PKS is the synthase for 6- 

methyl salicylic acid. 

4. The modified cell of claim 1 wherein the nucleotide sequence encoding 
said holo ACP synthase and the nucleotide sequence encoding at least a portion of 

2 0 said minimal PKS are fused so as to encode a fusion protein. 

5. The modified cell of claim 1 wherein said expression system for said 
minimal PKS and said expression system for said holo ACP synthase are present on 
separate vectors. 

6. The modified cell of claim I wherein at least one of said expression 
systems is integrated into the host cell chromosome. 
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7 A method to produce a polyketide which method comprises cuhunng 
the cells of claim I under conditions wherein said expression systems produce the 
encoded proteins and wherein said polyketide is synthesized 

z j 8. A recombinant host cell modified to contain either 

a) at least two vectors, said first vector containing a first selectable 
marker and a first expression system and said second vector containing a second 
selectable marker and a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 

1 0 expression systems contained on said vectors are effective to produce at least a 
minimal polyketide synthase (PKS); or 

b) at least one vector and a °\"dified chromosome, said one vector 
containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 

1 5 containing additional selectable markers and expression systems wherein said 

expression systems contained on said vectors in combination with said expression 
system on said chromosome are effective to produce at least a minimal PKS; 

said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
2 r > protein (ACP) activity for an aromatic PKS, or 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
and an ACP activity for a modular PKS. 

9. The cell of claim 8 which is a yeast cell, an E. coli cell, an 

2 5 actinomycete cell or a plant cell, 

10. The cell of claim 8 which further contains an expression system for a 
cell-based detection system for a functional polyketide. 
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1 1 The cell of claim 8 which produces at least a minimal aromatic PKS 
and which contains: 

(a) a first vector comprising a first selectable marker and an expression 
system comprising a nucleotide sequence encoding a KS AT catalytic region operably 
linked to a promoter operable in said cell: 

(b) a second vector comprising a second selectable marker and an 
expression system comprising a nucleotide sequence encoding a CLF catalytic region 
operably linked to a promoter operable m said cell; and 

(c) a third vector containing a third selectable marker and an expression 
1 0 system which comprises a nucleotide sequence encoding an ACP activity operably 

linked to a promoter operable in said cell 

12. The cell of claim 8 which produces at least a minimal modular PKS 
and which contains 

1 5 (a) a first vector containing a first selectable marker and an expression 

system for at least one module of a polyketide synthase (PKS) operably linked to a 
promoter operable in said cell, and 

(b) a second vector containing a second selectable marker and a nucleotide 
sequence encoding at least a second module of a polyketide synthase operably linked 

2 0 to a promoter operable in said cell. 

13. The cell of claim 12 wherein said first and second module are derived 
from different polyketide synthases. 

2 5 14. The cell of claim 13 wherein said nucleotide sequence encoding at 

least one module further contains a nucleotide sequence encoding a KR activity, or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity; or 
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wherein said nucleotide sequence encoding at least one module encodes a K.R, 
Dfl and ER activity; and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioest erase (T\i) activity 

r 

1 5. A method to produce a polyketide which method comprises culturing 
the cells of claim 8 under conditions wherein said expression systems produce the 
encoded proteins and wherein said poh- 1 Jc vnthesized. 

1 0 16 The cell of claim S which is further modified to contain a recombinant 

expression system for a holo ACP synthase 

17 A method to produce a polvketide which method comprises culturing 
the cells of claim 16 under conditions wherein said expression systems produce the 
1 5 encoded proteins and wherein said polyketide is synthesized. 



18. A library of polyketide synthases PKS or synthesized poiyketides 
which comprises a panel of individual colonies, each colony containing either 

a) at least two vectors; said first vector containing a first selectable 
2 0 marker and a first expression system and said second vector containing a second 

selectable marker and a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors are effective to produce at least a 
minimal polyketide synthase (PKS), or 

2 5 b) at least one vector and a modified chromosome, said one vector 

containing a first selectable marker and a first expression system and said modified 
chromosome containing a second expression system and optionally additional vectors 
containing additional selectable markers and expression systems wherein said 
expression systems contained on said vectors in combination with said expression 

30 system on said chromosome are effective to produce at least a minimal PKS, 
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said minimal PKS comprising a ketosynthase/acyl transferase (KS/AT) 
catalytic region, a chain-length factor (CLF) catalytic region and an acyl carrier 
protein (ACT) activity for an aromatic PKS. and 

said minimal PKS comprising a KS catalytic region, an AT catalytic region, 
L and an ACP activity for a modular PKS 

wherein the combination of vectors or of vector(s) and modified chromosome 
is different in each colony. 

19 The library of claim 18 wherein said colonies are colonies of yeast, 
10 coli, actinomycetes or plant cells. 

20, The library of claim 18 wherein each colony further contains an 
expression system for a cell-based detection system for a functional polyketide. 

15 21. The library of claim 18 wherein the PKS are aromatic PKS and each 

colony contains: 

(a) a first vector comprising a first selectable marker and an expression 
system comprising a nucleotide sequence encoding a KS/AT catalytic region operably 
linked to a promoter operable in said cell; 

2 0 (b) a second vector comprising a second selectable marker and an 

expression system comprising a nucleotide sequence encoding a CLF catalytic 
domain operably linked to a promoter operable in said cell. 

(c) a third vector containing a third selectable marker and an expression 
system which comprises a nucleotide sequence encoding an ACP activity operably 
2 5 linked to a promoter operable in said cell; 

wherein said combination of first, second and third vectors is different in each 

colony. 
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22 The library of claim 18 w herein the PKS are modular PKS wherein 
each colony contains 

a first vector containing a first selectable marker and an expression for at least 
one module of a PKS operably linked to a promoter operable in said cell; and 

S a second vector containing a second selectable marker and a nucleotide 

sequence encoding at least a second module of a polvketide synthase operably linked 
to a promoter operable in said cell; 

wherein said combination of first and second vectors is different in each 

colony. 

1 0 

23 The library of claim 22 wherein said nucleotide sequence encoding at 
least one module further contains a nucleotide sequence encoding a KR activity; or 

wherein the nucleotide sequence encoding at least one module encodes a KR 
and DH activity; or 

1 5 wherein said nucleotide sequence encoding at least one module encodes a KR, 

DH and ER activity, and/or 

wherein said nucleotide sequence encoding at least one module encodes a 
thioesterase (TE) activity. 

2 0 24. The library of claim 18 wherein each colony further contains a 

recombinant expression system for a holo ACP synthase 

25. A method to produce a library of polyketides which method comprises 
culturing the cells of claim IS under conditions wherein said expression systems 

2 5 produce the encoded proteins and wherein said polyketide is synthesized. 

26. A method to produce a library of polyketides which method comprises 
culturing the cells of claim 24 under conditions wherein said expression systems 
produce the encoded proteins and wherein said polyketide is synthesized. 
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27 A method to identify a polyketide that hinds a target receptor which 
method comprises contacting said receptor with each member of the library of claim 
1 X under conditions wherein binding to said receptor can he detected; and 

ii- detecting the presence or absence of binding to said receptor with respect to 

each member, whereby 

a member that binds to a receptor is identified 

28. A method to identify a polyketide that binds a target receptor which 
0 method comprises contacting said receptor with each member of the library of claim 
24 under conditions wherein binding to said receptor can be detected; and 

detecting the presence or absence of binding to said receptor with respect to 
each member, whereby 

a member that binds to a receptor is identified. 

L 

29 A method to identify a polyketide functional in a cell-based detection 
system which method comprises assessing each member of the library of claim 18 

for the presence or absence of signal in said cell-based detection system 

whereby a functional polyketide is identified. 

0 

30. A vector adapted for expression in yeast which vector contains a 
selectable marker operable in yeast, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
linked to a promoter operable in yeast. 



31 



A yeast cell modified to contain the vector of claim 30. 
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32 The veasl cell of claim 3 1 which further contains a recombinant 
expression system for a holo ACT synthase. 



33. A method to produce a polyketide synthase activity which method 
comprises culturing the yeast cell of claim 3 1 under conditions wherein expression is 
favored. 



34 A method to produce a po 1 ketide synthase activity which method 
comprises culturing the yeast cell of claim 32 under conditions wherein expression is 
favored 



35 A vector adapted for expression in E. coli which vector contains a 
selectable marker operable in E. coli, and an expression system which comprises the 
coding region of at least one functional polyketide synthase catalytic activity operably 
1 5 linked to a promoter operable in E. coli. 



36. An E. coli cell modified to contain the vector of claim 35. 

37 The E. coli cell of claim 36 which further contains a recombinant 
2 0 expression system for a hoio ACP synthase. 



38. A method to produce a polyketide synthase activity which method 
comprises culturing the E. coli cell of claim 36 under conditions wherein expression is 
favored. 



25 



39. A method to produce a polyketide synthase activity which method 
comprises culturing the E. coli cell of claim 37 under conditions wherein expression is 
favored. 
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